The information capacity of a single photon 
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Quantum states of light are the obvious choice for communicating quantum information. To date, encoding 
information into the polarisation states of single photons has been widely used as these states form an natural 
closed two state qubit. However, photons are able to encode much more - in principle infinite - information 
via the continuous spatio-temporal degrees of freedom. Here we consider the information capacity of an optical 
quantum channel, such as an optical fibre, where a spectrally encoded single photon is the means of commu- 
nication. We use the Holevo bound to calculate an upper bound on the channel capacity, and relate this to the 
spectral encoding basis and the spectral properties of the channel. Further, we derive analytic bounds on the 
capacity of such channels, and in the case of a symmetric two-state encoding calculate the exact capacity of the 
corresponding channel. 



Introduction — Single photons are an ideal candidate for 
efficiently communicating both quantum and classical infor- 
mation (2. Unlike many other quantum systems, photons are 
inherently 'flying', making them ideal for quantum commu- 
nication tasks, including quantum key distribution and dis- 
tributed quantum computation. In optical quantum informa- 
tion processing El [3) it is common to encode a qubit into 
the polarisation of a single photon. That is, a qubit is defined 
as a\H) + P\V), or a classical bit can be communicated by 
choosing \H) or \V). Alternately, an encoding could be per- 
formed in the photon-number or quadrature bases. These cases 
have been studied extensively by previous authors RUlH. For 
example, the Fock basis, {|n)}, could be employed to encode 
an alphabet with a number of letters limited only by energy 
constraints. While the alphabet may in principle be arbitrarily 
large, once loss is introduced or physically realistic encod- 
ing procedures and photo-detectors are employed, which in- 
troduces mixing in the photon number degree of freedom, the 
information capacity is limited. 

In this Letter we approach photonic information capacity 
from an entirely different perspective. We fix the number of 
photons at n = 1, and encode information into its spectral de- 
gree of freedom lfT2ll . Since the spectral degree of freedom 
is continuous, in principle infinite information could be trans- 
mitted by a single photon encoded in this basis. However, sub- 
ject to realistic channel, detector and photon engineering con- 
straints, the communicable information is reduced. We exam- 
ine the information capacity of a single photon via encoding 
in the spectral domain and derive bounds on the channel ca- 
pacity using such an encoded photon under realistic assump- 
tions about the communications channel and photo-detector. 
We relate the channel capacity to the spectral response of the 
channel and photo-detector, and the choice of spectral encod- 
ing basis. 

The spectral structure of photons — A photon can be ex- 
pressed as a superposition of different spectral components, 
allowing an Af-level qudit to be encoded, where N can in 
principle be arbitrarily large. To perform such encoding we 
choose a set of spectral functions {^(cj)}, where uo is fre- 
quency relative to a central carrier frequency. Ideally we 



would like these functions to form an orthonormal basis, 
X^o ^ii^^ji^T da; = ^ j, such that they can always be 
perfectly distinguished with an appropriate measurement de- 
vice. In reality, however, orthogonality might only be approx- 
imate. We define photonic mode operators 031 which cre- 
ate photons with a well defined spectral distribution function 
= X^L ^(00)0) (uu) duo, where a*(uj) is the sin- 
gle frequency photonic creation operator in spatial mode a. 
We will employ the shorthand , JO) = \i), where |0) is 
the vacuum state. Then a spectral basis state may be expressed 
™pi = Al i{u) \0)(0\A Mu) = \i)(i\. 

In principle, any basis {ipi(oj)} could be chosen, such as 
frequency or temporal delta functions, wavelet families, Her- 
mite polynomials, or any other set of functions satisfying or- 
thonormality. However, photon engineering fT2l [1414211 is an 
emerging field and not all states can be readily prepared on- 
demand with sufficient fidelity. 

Information capacity of a single photon — Let Alice en- 
code classical information by choosing i in the range (1, A 7 "). 
Thus, a single photon sends a letter from an A" letter alpha- 
bet. Next we propagate the spectrally encoded state through 
a channel (such as an optical fibre), which has a frequency- 
dependent transmission function rj(u). That is, the channel 
has probability t](uj) 2 of propagating a photon of frequency 
u, otherwise it is absorbed by the channel. Here we will as- 
sume that the channel is Markovian, as this assumption holds 
true for most physical mechanisms inducing photon loss, and 
hence we are free to model the channel as a frequency de- 
pendent beamsplitter l22l . where the reflected component is 
traced out. The spectral response of a photo-detector after the 
channel can be merged with the spectral response of the chan- 
nel, T](UJ) = Channel M ^detector M- 

When a spectrally encoded basis state passes through this 
channel, the output state is (after tracing over the envi- 
ronment), p\ = ^ (w) ^ (w) |0)(0|^ (aj)v ,. (w) +€i|0) (0|, where 

e % — |\A — 7 ?( CJ ) 2 V ; i( a; )| 2 do;. Thus, after the channel 
we have a mixture of the vacuum state (corresponding to the 
absorbed component), and a single photon with spectral dis- 
tribution function modulated by the spectral response of the 
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FIG. 1: (Colour online) Example choice of encoding basis 
where the basis states are not perfectly orthogonal, and an exam- 
ple channel spectral response function tj(uj). The functions {ipi(uj)} 
are identical Gaussians of different means, separated in frequency by 
multiples of Au. By letting the Gaussians have zero variance, one 
could achieve orthonormality and encode a qudit with an arbitrary 
number of levels. However, this would be physically unrealistic. In 
this example we have centred uo around 0, which is unphysical. Thus, 
uj should be interpreted as being relative to some central carrier fre- 
quency. All the integral overlaps defining the channel capacity are 
invariant under uniform translations in oj, so the choice of centre fre- 
quency does not affect the results. Other examples of encoding bases 
are time-bin encoding, which would yield the same results in the 
conjugate domain, or orbital angular momentum encoding. 

channel. Note that ^^^-(u) 1^) * s m g enera l not normalised. 

We wish to establish how much information Alice is able 
to communicate to Bob using her spectrally encoded sin- 
gle photon state across the channel. Because the spectral re- 
sponse of the channel modulates the spectral basis states, in 
general the optimal choice of measurement basis for Bob 
will not be the same as Alice's encoding basis. To place 
an upper bound on the mutual information between Alice 
and Bob, we calculate the Holevo bound (T] [23), which 
bounds the mutual information under any choice of measure- 
ment basis for Bob. Formally, the mutual information be- 
tween Alice and Bob is bounded by the Holevo quantity as 
H(A : B) < S(p') - ZiPiStfi), where pf = ZiPiPi and 
Pi is the a priori probability that basis state i will be transmit- 
ted. We emphasise that the Holevo bound is merely an upper 
bound on the mutual information, which, in general, cannot 
always be saturated. 

The mixture observed by Bob is 
P' = ZiPi(Al iu)Moj) \0)(0\A v{oj)Moj) +e i |0)<0|). The 
terms in this mixture have been modulated and are in general 
no longer orthonormal. We will re-express the output state in 
some orthonormal basis {^(u)}, 

p' = E^E( A ? )A 5 r ^Mi°)< i^H+^io)<oi) 

*=i 3d' 

= E y «'li)0"l+e|0)(0|, (1) 
3d' 

where = (/>j((j)^i(uj)*rj(uj) du, 



Y 3d' = Y^iLlPi^^ > and e = Yn=iPi e i- Then ^ 
can be calculated that 

S(p'i) = -€i \og 2 €i - (1 - ei)log 2 (l - e»), 

N 

s(p') = -eiog 2 e-^y;io g2 y;, (2) 

where Y- is the jth eigenvalue of Y. Thus, the Holevo bound 
is 

H(A : B) < -elog 2 e - ^F/log.F/ 
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N 

+ ^2Pi^i^g 2 €i + (1 - ei)log 2 (l - €i)]. (3) 

i=l 

The Holevo bound is maximised by optimising over p i9 which 
may be prohibitive for large N. 

Classical channel capacity — In photonic quantum com- 
putation O it is common to accommodate for lossy chan- 
nels via post-selection. That is, we discard events where the 
wrong number of photons are measured due to photon loss. 
In the case of a communications channel, both post-selected 
and non-post-selected scenarios are useful, and we will con- 
sider these two scenarios separately as they are suited to in- 
herently different situations. First, note that if a photon is sent, 
detection of "no photon" actually contains information about 
the encoded state, and it is therefore in general sub-optimal 
to post-select out such events. Specifically, the loss of a pho- 
ton gives us information that the encoded basis state was more 
likely to be in the region where tj(uj) is low. For example, con- 
sider Fig.[T] In this example, if no photon is detected it is more 
likely that Alice's encoded letter was ipi(u) or ^5 (a;) than 
ip3 (uj). Thus, photon loss communicates information from Al- 
ice to Bob, which would be discarded if post-selection were 
introduced into the protocol. 

In the case of a constant bit-rate communications channel, 
where photons are being transmitted at predictable regular in- 
tervals, this observation leads us to conclude that it is best 
not to post-select and instead interpret photon loss as a legiti- 
mate signal. In this case, the classical channel capacity is sim- 
ply C < H(A : B). On the other hand, with a variable bit-rate 
channel, there is no way of knowing whether measurement of 
the vacuum state corresponded to photon loss or simply lack 
of a transmission. In this case post-selection is necessary and 
C < (1 — e)5(pp S ), where pp S is p' post-selected on there be- 
ing a photon. 

Numeric results — We now calculate an upper bound on 
the capacity of the channel in a specific subset of encodings, 
as quantified by the Holevo bound, before going on to derive 
general bounds on the capacity of such channels later in this 
Letter. We consider a spectral basis comprised of fixed- width, 
displaced Gaussians, oc g-C^-O'- 1 )^) / 4 ? each offset 

by Acu from the next, as shown in Fig. [T] [26]. Optimising 
H (A : B) over pi for large N is prohibitive, so in our numeric 
analyses we will make the simplifying assumption that Alice 
is employing a uniform encoding, pi = 1/N l27l . 
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FIG. 2: (Colour online) Upper bound on the classical channel capacity (bits per transmitted photon) using Gaussian wave-packets, displaced by 
multiples of Auj, where Alice employs uniform encoding, pi = 1/N. (left) Flat spectral response channel (efficiency 77), with or without post- 
selection, (middle) Gaussian spectral response channel (bandwidth a), without post-selection, (right) Gaussian spectral response channel, with 
post-selection. We encode across N = 32 basis states, a maximum of log 2 N = 5 bits of information. In the limit of Auj ^> 1 (distinguishable 
photons), and a ^> 1 or 77 = 1 (a perfect channel), the upper bound on C asymptotes to the maximum achievable log 2 iV. In this limit C 
saturates the bound. 



We consider two situations. First, when the channel spectral 
response function is a constant, rj(u) = 77, and second, when 
the channel spectral response function is Gaussian [28) with 
standard deviation a, rj(u) = e~^l^ / 4 . The upper bound on 
C is plotted in Fig.[2| where we encode across TV = 32 spectral 
basis states, for a maximum of \og 2 N = 5 bits of information. 

For a flat spectral response function we observe a mono- 
tonic increase in C as both the wave-packet separation (i.e. 
photon distinguishability) and channel efficiency increase. In 
the limit of large Auj and 77 = 1, we observe the upper bound 
is the maximum achievable C < log 2 7V bits. It is obvious that 
C must increase monotonically with Auj, since the extractable 
information to Bob will depend on how well he can distin- 
guish the different basis states. For sub-unit efficiency, the ba- 
sis states become mixed with the vacuum state, which dimin- 
ishes their distinguishability, thus C must drop against loss. 
Note that for flat channel spectral response there is no bit-rate 
difference between the post-selected and non-post-selected 
cases. This is because the channel introduces no bias which 
enables the vacuum state to convey information about the en- 
coded state. Thus, it makes no difference if it is post-selected 
away. 

In the case of a Gaussian spectral response function, with 
perfect efficiency at 77(0) = 1, we observe that as the standard 
deviation of the spectral response function increases, so does 
C. In the limit of large a, the spectral response becomes flat 
with unity efficiency, 77 (uo) —> 1, and with large Auj we find 
the upper bound on the channel capacity is C < log 2 7V, the 
theoretical maximum. In this limit Bob always observes the 
same state Alice transmitted, a basis of orthogonal pure states, 
and C saturates the bound when frequency-resolved photo- 
detection is employed by Bob. 

For indistinguishable photons, no measurement performed 
by Bob is able to discern which encoded basis state is be- 
ing transmitted, and thus the information capacity is zero for 
Auj = 0. Similarly, for a very narrow channel spectral re- 
sponse function, Bob always measures the vacuum state and 
no information may be communicated. 

As expected, for Gaussian response, the post-selected chan- 
nel bandwidth is strictly less than the non-post-selected band- 



width, owing to the information which is discarded during 
post-selection. 

With a finite bandwidth channel there reaches a point where 
adding more basis states to the alphabet will not enhance the 
information capacity of the channel, since the additional let- 
ters reside in the region where t](uj) « 0. On the contrary, it 
becomes counterproductive to employ additional letters since 
we are shifting the probability distribution within p' into a 
region where no information may be communicated. In Fig. 
[3] we illustrate the relationship between the number of basis 
states, channel bandwidth, and C. Evidently, for a given chan- 
nel there is always a finite optimal value for N, shown by 
the red line. Thus, in general it is not optimal to always en- 
code across the largest possible alphabet. Rather, the optimal 
alphabet size is a function of the channel spectral response. 

2000 








FIG. 3: (Colour online) Upper bound on the classical channel capac- 
ity (without post- selection) as a function of the alphabet size, N, and 
channel bandwidth, a. The red line shows the optimal N for a given 
a. Note that the optimal encoding is calculated based on the Holevo 
bound, which only provides an upper bound on C, which may not 
necessarily be saturated. Thus, the true optimal encoding may be dif- 
ferent. 

Analytic bounds — We have previously considered the 
Holevo bound as a means for determining an upper bound 
on the channel capacity using a single photon. In certain cir- 
cumstances, however, there is another option for bounding the 
capacity, which we now describe. Consider a photon encoded 
as ipi(u) ex e _ ( d i- w ) /( 4cr ^), which passes through a channel 
with t)(uj) = ^Jp^^e~ UJ /( 4 %), a Gaussian scaled such that 
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the peak transmission probability is p pea k- 

The probability of the photon passing through the channel 
without being lost is qi = ^(cj) 2 ^ (uj) 2 duj, which in this 
case yields 



Ppeak 



exp 



d 2 



2(^ + ^) 



(4) 



and the state of the photon, if it is transmitted, has spectral 
distribution function ^-(cj) = -^=77(0;)^ (a;). 

Note that the minimum probability of photon loss obtained 
by maximising qi over d for fixed is 



Ppeak 



0> V G 



(5) 



" 0V, 



and hence an upper bound on the channel capacity is given by 
the channel capacity of a quantum erasure channel with era- 
sure probability k = 1 — q max . This channel has been studied 
extensively for the case of qubits, for which analytic results 
are known for both the quantum and classical capacities l24l . 
These arguments can trivially be extended to qudits to give 
the classical channel capacity (C), quantum capacity (Q), and 
quantum capacity assisted by two-way classical communica- 
tion (Q2), all of which are equal to (1 — k) log 2 N for the 
quantum erasure channel. 

Thus, for the case of information encoded as a photon pass- 
ing through a channel with a Gaussian transmission profile, 
where each letter is encoded as a Gaussian distribution over 
frequencies, all three capacities are bounded from above by 



C, Q, S2 < 



PpeaklQg2^ 



(6) 



In the alternate case of constant transmission probability, 
r] (lj) = 77, as the overlap between the Gaussians encod- 
ing different letters tends to zero the channel approaches 
a quantum erasure channel and the capacities tend to 
C, Q, Q2 = r] 2 \og 2 N from below. 

Calculating exact channel capacities can be challenging. 
However, in the restricted case of two- state Gaussian encod- 
ing, it can be shown that the classical channel capacity is given 
exactly by, 



^ — Ppeak 



3 8(1 + A 2 ) 



( 



3 2 (1+A 2 ) \ 



V 



(7) 
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where S = Au/a v and A = a^/a v . Fig. [4] shows the maxi- 
mum value this capacity can take as a function of A. The proof 
is given in the Supplementary Information. 

Conclusion — We have discussed the scenario where a sin- 
gle photon, encoded in the spectral degree of freedom, com- 
municates information between two parties using a channel 
with well-defined spectral response. We analytically derived 




FIG. 4: Maximum classical channel capacity for two-state Gaussian 
encoding as a function of the ratio A = a^p/a^. 



upper bounds on the achievable classical and quantum chan- 
nel capacities under a physically realistic model for the com- 
munications channel - specifically, a channel with frequency- 
dependent absorption properties. 

We noted that, in general, post- selection upon detection of 
a photon is sub-optimal, since measuring the vacuum state ac- 
tually carries information about the encoded state sent by the 
transmitter. However, depending on the application, specif- 
ically in the context of a variable bit-rate channel, post- 
selection may be necessary. We calculated an upper bound 
on the classical channel capacity in the cases of both post- 
selected and non-post-selected communication, and on the 
quantum capacity in the case of Gaussian encoding. 

We argued that, in general, it is not optimal to always en- 
code across the largest possible alphabet. Rather, the optimal 
alphabet size will be a function of the channel spectral re- 
sponse. 
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Supplementary information 

We now derive an analytic bound on the classical chan- 
nel capacity in the restricted case of two-state encoding using 



Gaussian wave-packets. Consider a photon encoded as 
1 



-(d,-u,)7(4<4) 



/27TCr 



which passes through a channel with 

77H = v^e- 2 /(4^) 5 



(8) 



(9) 



a Gaussian scaled such that the peak transmission probability 

IS Ppeak- 

The probability of the photon passing through the channel 
without being lost is 



Ppeak 



=exp 



2(^ + ^) 



, (10) 



and the state of the photon, if it is transmitted, has spectral 
distribution function given by ^[(uo) = ■^=rj(u) , ipi(uj). 

In general, the task of calculating the exact capac- 
ity of a communications channel is challenging, and has 
not been possible for most channels of interest. However, 
for the case of information encoded in one of two pure 

states, the classical capacity is known to be I2 ^ 1 ~ sl 2 n( ^ ^ 

l25ll , where the overlap of the two states is cos (a), and 
l2(x) = 1 + x\og 2 x + (1 — x) log 2 (l — x). Thus we have 

If we consider a two letter alphabet encoded by two Gaus- 
sians of standard deviation and separation between centres 
of Auj, then their overlap is e~ AuJ ^ 8a ^\ and hence the clas- 
sical capacity of the corresponding lossless channel will be 
given exactly by 



C = h 



-Au;V(4aJ) 



(ID 



If we consider the state after propagating through a channel 
t](uj) with a Gaussian transmission profile, as described ear- 
lier, then the output state for each encoded letter (if the pho- 
ton is not absorbed) will be given by ^[(uo) = ^=7/(0;)^ (lj). 
Since this is proportional to the product of two Gaussians, the 
result will be another Gaussian wave-packet with 



and di 



-(d tv -uj) 2 /(4al r] ) 



d % 



(12) 



where a^n - 

In order to ensure that the encoding works well in the post- 
selection regime, we ensure that detection or non-detection 
of a photon reveals no information about which letter is 
encoded by making the assumption that do = — Auj/ 2 and 



d\ = Auj/2. The overlap between i(j f and is then given 
by e -* 2 (i+A 2 )/(8A 2 ) ? where S = Au/a^ and A = a^/a^. The 
classical capacity of this channel is thus 



C = qoh 



Si 



<5 2 (1 + A2) \ 



8A 2 



Maximising this value over 5 we obtain the exact maximum 
channel capacity as a function of p pea k and A. 



